Similarity Measures for Relational Databases

نویسندگان

  • Melita Hajdinjak
  • Andrej Bauer
چکیده

We enrich sets with an integrated notion of similarity, measured in a (complete) lattice, special cases of which are reflexive sets and bounded metric spaces. Relations and basic relational operations of traditional relational algebra are interpreted in such richer structured environments. An canonical similarity measure between relations is introduced. In the special case of reflexive sets it is just the well known Egli-Milner ordering while in the case of bounded metric spaces it is the Hausdorff metric. Some examples of how to perform approximate searches (e.g., similarity search and relaxed answers) are given.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Ontologies to Improve Answer Quality in Databases

Title of dissertation: USING ONTOLOGIES TO IMPROVE ANSWER QUALITY IN DATABASES Yu Deng, Doctor of Philosophy, 2006 Dissertation directed by: Professor V.S. Subrahmanian Department of Computer Science One of the known shortcomings of relational and XML databases is that they overlook the semantics of terms when answering queries. Ontologies constitute a useful tool to convey the semantics of ter...

متن کامل

Logical Foundations for Similarity-Based Databases

Extensions of relational databases which aim at utilizing various aspects of similarity and imprecision in data processing are widespread in the literature. A need for development of solid foundations for such extensions, sometimes called similarity-based relational databases, has repeatedly been emphasized by leading database experts. This paper argues that, contrary to what may be perceived f...

متن کامل

Using relational databases for improved sequence similarity searching and large-scale genomic analyses.

Relational databases are designed to integrate diverse types of information and manage large sets of search results, greatly simplifying genome-scale analyses. Relational databases are essential for management and analysis of large-scale sequence analyses, and can also be used to improve the statistical significance of similarity searches by focusing on subsets of sequence libraries most likely...

متن کامل

Similarity-based learning on structures

The seminar centered around di erent aspects of similarity-based clustering with the special focus on structures. This included theoretical foundations, new algorithms, innovative applications, and future challenges for the eld. For nding the structure in the data set's smothers many tools are related like sisters and brothers. We conclude in the sequel: All methods are equal! (But some are mor...

متن کامل

بررسی قابلیت بهکارگیری سنجه های مرکزیت به عنوان شاخصهای ارتباط استنادی مدارک در بازیابی اطلاعات رابطه ای: مطالعۀ مقدماتی

Purpose: this is a pilot study tends to investigate correlation between centrality measures with bibliographic coupling as a well-known citation-based document similarity measure.  Methodology: using citation analysis method, 40 research articles belonging to four engineering/pure disciplines (Physics, Chemistry, Biology, and computer) and four Humanities and Social disciplines (Economics, Edu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Informatica (Slovenia)

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2009